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Abstract — Energy-harvesting wireless sensor networking is an 
emerging technology with applications to various fields such as 
environmental and structural health monitoring. A distinguish- 
ing feature of wireless sensors is the need to perform both 
source coding tasks, such as measurement and compression, 
and transmission tasks. It is known that the overall energy 
consumption for source coding is generally comparable to that 
of transmission, and that a joint design of the two classes of 
tasks can lead to relevant performance gains. Moreover, the 
efficiency of source coding in a sensor network can be potentially 
improved via distributed techniques by leveraging the fact that 
signals measured by different nodes are correlated. 

In this paper, a data gathering protocol for multihop wireless 
sensor networks with energy harvesting capabilities is studied 
whereby the sources measured by the sensors are correlated. 
Both the energy consumptions of source coding and transmis- 
sion are modeled, and distributed source coding is assumed. 
The problem of dynamically and jointly optimizing the source 
coding and transmission strategies is formulated for time- 
varying channels and sources. The problem consists in the 
minimization of a cost function of the distortions in the source 
reconstructions at the sink under queue stability constraints. 
By adopting perturbation-based Lyapunov techniques, a close- 
to-optimal online scheme is proposed that has an explicit 
and controllable trade-off between optimality gap and queue 
sizes. The role of side information available at the sink is 
also discussed under the assumption that acquiring the side 
information entails an energy cost. It is shown that the presence 
of side information can improve the network performance both 
in terms of overall network cost function and queue sizes. 

Index Terms — Wireless Sensor Networks, Data Gathering, 
Energy Harvesting, Distributed Source Coding, Lyapunov Op- 
timization. 



I. Introduction 

Wireless sensor networks have found applications in a 
large number of fields such as environmental sensing and 
structural health monitoring [1]. In such applications, the 
maintenance necessary to replace the batteries when de- 
pleted is often of prohibitive complexity, if not impossible. 
Therefore, sensors that harvest energy from the environment, 
e.g., in the form of solar, thermal, vibrational or radio 
energy [2] [3], have been proposed and are now commercially 
available. 

* Department of Information Engineering, University of Padova, Italy. 
tcWCSPR, New Jersey Institute of Technology, New Jersey, USA. This 
work has been supported in part by the FP7 EU project "SWAP" G.A. no. 
251557. The work of O. Simeone was partially supported by US NSF under 
grant CCF-0914899. 



Given the interest outlined above, the problem of designing 
optimal transmission protocols for energy harvesting wireless 
sensor networks has recently received considerable attention. 
In the available body of work reviewed below in Section I-B, 
the only source of energy expenditure is the energy used 
for transmission. This includes, e.g., the energy used by 
the power amplifiers. However, a distinguishing feature of 
sensor networks is that the sensors have not only to carry 
out transmission tasks, but also sensing and source coding 
tasks, such as compression. The source coding tasks entail 
a non-negligible energy consumption. In fact, reference [4] 
demonstrates that the overall cost required for compression 1 
is comparable with that needed for transmission, and that a 
joint design of the two tasks can lead to significant energy 
saving gains. Another distinguishing feature of sensor net- 
works is that the efficiency of source coding can be improved 
via distributed source coding techniques (see, e.g., [5]) by 
leveraging the fact that sources measured by different sensors 
are generally correlated (see, e.g., [6]). 

A. Contributions 

In this paper, we focus on an energy-harvesting wireless 
sensor network and account for the energy costs of both 
source coding and transmission. Moreover, we assume that 
the sensors can perform distributed source coding to leverage 
the correlation of the sources measured at different sensors. 
A key motivation for enabling distributed source coding in 
energy-harvesting networks is that this enables sensors with 
correlated measurements to trade energy resources among 
them, to an extent determined by the amount of correlation. 
For instance, a sensor that is running low on energy can 
benefit from the energy potentially available at a nearby node 
if the latter has correlated measurements. This is because, 
through distributed source coding, the transmission require- 
ments on the first sensor are eased by the transmission of 
correlated information from the nearby sensor. 

We study the problem of dynamically and jointly opti- 
mizing the source coding and transmission strategies over 
time- varying channels and sources. The problem consists in 
the minimization of a cost function of the distortions in the 
source reconstructions at the sink under queue stability con- 
straints. Our approach is based on the Lyapunov optimization 

'This reference considers transmission of Web data. 



strategy with weight perturbation developed in [7]. We devise 
an efficient online algorithm that only takes actions based 
on the harvested energy, on the current state of channel, 
queues and energy reserves, and also based on the statistical 
description of the source correlation. We prove that the 
proposed policy achieves an average network cost that can be 
made arbitrarily close to the optimal one with a controllable 
trade-off between the sizes of the queues and batteries. 

We also investigate the role of side information available 
at the sink under the assumption that acquiring the side 
information entails an energy cost. It is shown that properly 
allocating the available (harvested) energy to both the tasks 
of transmission and side information measurement has signif- 
icant benefits both in terms of overall network cost function 
and queue sizes. 

B. Prior Work 

We start by introducing related prior work that assumes 
energy harvesting. The literature on this topic is quickly 
increasing in volume but it mostly (with the exception of [8]) 
accounts only for the energy consumption of the transmission 
component, and does not model the contribution of the source 
coding part. In this context, references [9] and [10] studied 
the problem of maximizing the throughput or minimizing 
the completion time for a single link energy-harvesting 
system by focusing on both offline and online policies (see 
also [11], [12]). A related work is also reference [13] that 
finds a power allocation policy that stabilizes the data queue 
whenever feasible. Still, for a point-to-point system, using 
large deviation tools, the effect of finite data queue length and 
battery size is studied in [14] in terms of scaling results as 
the battery and queue grow large. We now consider work on 
multihop energy-harvesting networks. As mentioned above, 
all the works at hand only account for the energy used for 
transmission. Moreover, source correlations and distributed 
source coding are not accounted for. In [7] assuming in- 
dependent and identically distributed (i.i.d.) channel states 
and energy harvesting processes, a Lyapunov optimization 
technique with weight perturbation [15] is leveraged to 
obtain approximately optimal strategies in terms of a general 
function of the data rates under queue stability constraints. 
The proposed technique obtains an explicit trade-off in terms 
of data queue length and battery size. An extension of this 
work that assumes more general arrival, channel state and 
recharge processes along with finite batteries and queues is 
put forth in [16]. Also related are [17], [18] and [19] that 
tackle similar problems. 

We now discuss work that accounts for the energy trade- 
offs related to source coding and transmission. These works 
(except [8]) do not model the additional constraints arising 
from energy harvesting. Moreover, they do not allow for 
distributed source coding. The joint design of source coding 
and transmission parameters is investigated through various 
algorithms, for either static scenarios in [20], [21] or dynamic 
scenarios in [22], [23]. Specifically, references [22] and [23] 




Figure 1. A set AT of energy-harvesting nodes communicate correlated 
sources to a destination d. For the more general model of Section VI, the 
destination d acts as a cluster head and communicates to a network collector 
node (shown in dashed lines). In this latter model, the node d can collect 
side information correlated to the sources measured by the nodes. 



studied the trade-offs between energy used for compression, 
or more generally source coding, and transmission by as- 
suming i.i.d. source and channel processes and arbitrarily 
large data buffer. Using Lyapunov optimization techniques, 
a policy with close-to-optimal power expenditure and an 
explicit trade-off with the delay is derived for a given average 
distortion. The problem of optimal energy allocation between 
source coding and transmission for a point-to-point system 
was studied in [8]. 

Finally, distributed source coding techniques for multihop 
sensor networks has been studied in [24] and [25]. In [24], 
the problem of optimizing the transmission and compression 
strategy was tackled under distortion constraints in a central- 
ized fashion, whereas [25] proposes a distributed algorithm 
that maximizes an aggregate utility measure defined in terms 
of the distortion levels of the sources. Both these works do 
not consider energy harvesting nor the energy consumption 
of the sensing process. 

C. Paper organization 

The rest of the paper is organized as follows. In Section II 
we present the system model and we state the optimization 
problem. In Section III we obtain a lower bound on the 
optimal network cost for the proposed problem. In Section IV 
we present the proposed algorithm designed following the 
Lyapunov optimization framework and we show how it can 
be implemented in a distributed fashion. Section V formalizes 
the main results of our paper and provide analytical insights 
into the performance of the proposed policy. Section VI 
proposes an extended version of the problem, where the 
sink node acts as a cluster head that is able to acquire 
correlated side information to improve the system perfor- 
mance. In Section VII we prove the effectiveness of our 
analytical analysis and discuss the impact of the optimization 
parameters. Section VIII concludes the paper. 

II. System model 

We consider a wireless network modeled by a direct graph 

Q = (AfU{d},£), where TV = {1,2,..., AT} is the set 



of nodes in the network, d is the destination (or sink), and 
C c {(n, m): n,m € AfU {d}, n 7^ m} represents the set of 
communication links, see Fig. 1 for an illustration. We define 
^max as the maximum number of transmission links that any 
node n <G N U {d} can have. As discussed below, we allow 
for fairly general interference models. We will consider a 
more general model in Section VI in which the sink acts 
as a cluster head for the set of nodes Af, and reports to a 
collector node c (see Fig. 1). 

A. Transmission Model 

The transmission model follows the framework of, 
e.g., [26]. According to this model, the network operates in 
slotted time and, at every time slot t = 1,2,..., each node 
n € Af allocates power P n , m (t) > to each outgoing link 
(n, m) <G C for data transmission. In what follows, we refer 
to the number of channel uses (or transmission symbols) per 
time slot as the baud rate multiplied by the slot duration. At 
the generic time slot t we define P(i) = {P n , m (t)}, with 
(n, m) e C, as the power allocation matrix and the total 
transmission power of node n, that is 



P n (t) = J2 P n,m(t), 



(1) 



m: (n.m)£C 



which is assumed to satisfy the constraint P n (t) < P max , 
for some P max < 00. The transmission rate /!„.,„(£) on 
link (n,m) depends on the power allocation matrix P(£) 
and on the current channel state S(i) — {S ntTn (t)} with 
(n, m) e C. The latter accounts, for instance, for the current 
fading channels or for the connectivity conditions on the 
network links. We assume that S(i) takes values in some 
finite set S = (si, S2, . . . , Sm), is constant within a time slot, 
but is independent and identically distributed (i.i.d.) across 
time slots. We use p s . = Pr [S(i) = sf\ for i = 1, . . . , M. 
We write 

/*n,m(t) = C n , m (P(t),S(t)), (2) 

where C n . m (P(t),S(t)) is the capacity-power curve for 
link (n, m) expressed in terms of bits per channel use 
(transmission symbol). The latter depends on the specific 
network transmission strategy, which includes the modulation 
and coding/decoding schemes used on all links. We assume 
that function C„. m (P(t),S(t)) is continuous in P(i) and 
non decreasing in P n , m (t). An example of the function 
C Il]m (P(i),S(i)) is the Shannon capacity obtained by treat- 
ing interference as noise at the receivers, namely 



C n , m (P(t),S(t))oclog 1 
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(3) 



where S n . m (t) represents the channel power gain on link 
(n, m) and Nq is the noise spectral density. We assume that 
there exists some finite constant /i m « such that n n ,m(t) < 
/Umax for all t, any power allocation vector P(£) and channel 



state S(t). Moreover, following [7], we assume that the 
function C n . m (P(t),S(£)) satisfies the following properties: 
Property 1: For any power allocation matrix P(£), we 
have: 

C n>m {P(t),S(t)) < SP n>m (t), (4) 

for some finite constant £ > 0; 

Property 2: For any power allocation matrix P(£) and 
matrix P'(i) obtained by P(i) by setting the entry P n ,m(t) 
to zero for a given (n, m) pair, we have: 



C , 6 (P(t),S(t))<C , 6 (P'(t),S(t)), 



(5) 



for all (a, b) e C, with (a, b) 7^ (n, m). 

Note that both properties are satisfied by typical choices of 
function C„ ;TO (P(£),S(£)) such as (3). In fact, Property 1 is 
satisfied if function C„. m (P(t), S(i)) is concave with respect 
to P n .m(t), while Property 2 states that interference due to 
power spent on other links cannot be beneficial. 2 Finally, we 
define the total outgoing transmission rate jU n ,*(i) from a 
node n € M at time t as 

A»n,*(*) = X! Atn,m(*)j (6) 

m: (n,m)(E£ 

and the total incoming transmission rate /u*, n (i) at a node 

n e N as 

H*,n(t) = ^2 Hm,n(t)- (7) 

B. Data Acquisition, Compression and Distortion Model 

At each time slot, each node of the network is able to 
sense the environment and to acquire spatially correlated 
measurements. The measurements are then routed through 
the network to be gathered by a sink node, as illustrated in 
Fig. 1. Before transmission, the acquired data is compressed 
via adaptive lossy source coding by leveraging the spatial 
correlation of the measurements. Specifically, we define the 
source state at time t as the spatial correlation matrix 
describing the signal within this time slot, which is referred 
to as O(t) = {O n . m (t)} with n,m € Af. We assume that 
O(i) takes values in some finite set O = {01,02, . . . ,Ol}, 
remains constant within a time slot, but is i.i.d. across time 
slots. Additionally, we define the mdf p . — Pr[0(£) = Oj]. 
Each node n € Af compresses the measured source with rate 
R n (t) < R max bits per source symbol and targets a repro- 
duction distortion at the sink of D mm < D n (t) < D max , 
with < i? max , Dmin < Dmax < CO- Note that imposing 
a strictly positive lower bound on D n (t) is without loss 
of generality because the rate R n (t) is upper bounded by 
a finite constant and therefore the distortion D n (t) cannot 
in general be made arbitrarily small (see, e.g., [5]). The 
distortion is measured according to some fidelity criterion 
such as mean square error (MSE). We define the rate vector 
as R(£) = (.Ri(£), . . . , i?jv(t)) and the distortion vector as 

2 This may not be the case if sophisticated physical layer techniques are 
used, such as successive interference cancelation (see, e.g., [5]). 
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Figure 2. Illustration of the rate region (8) for correlated sources and 
Af = {1, 2}. For all rate pairs (Ri(t), R2(t)), there exists acoding schemes 
that enables the sink to recover the two sources with distributed distortion 
(MSE) levels Di(t) and D2(t), respectively. 



D(£) = (D\(t), . . . ,Djv(i)). Due to the spatial correlation 
of the measurements, distributed source coding techniques 
can be leveraged. Thanks to these techniques, the rates of 
different users can be traded without affecting the achievable 
distortions, to an extent that depends on the amount of 
spatial correlation [5], The adoption of distributed source 
coding entails that, given certain distortion levels D(£), the 
rates R(i) can be selected arbitrarily as long as they satisfy 
appropriate joint constraints. Under such constraints, a sink 
receiving data at the specified rates is able to recover all 
sources at the given distortion levels. 

To elaborate on this point, consider the following condi- 
tions on the rates R n (t) and distortions D n (t) for n G Af: 

]T R„(t) > g(*,0(t))-log ( (27re)l*l [] A,(*) J , (8) 
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for all X C Af, where g(X,0(t)) denotes the joint con- 
ditional differential entropy of the sources measured by the 
nodes in the subset X, where conditioning is with respect to 
the sources measured by the nodes in the complement Af\X. 
For instance, for jointly Gaussian sources with zero mean and 
correlation matrix O(t), we have 



M)/wu 1, / detO(t) \ 



(9) 



where 0(£)|jv"\a? represents the correlation submatrix of the 
sources measured by nodes in Af \ X. If the rates satisfy 
conditions (8), it is known [27] that, for sufficiently small 
distortions and any well-behaved joint source distribution, 
the sink is able to recover all the sources within MSE levels 
D n (t) for all n G Af. We remark that this conclusion is also 



valid for any distortion tuple D(£) if the sources are jointly 
Gaussian. 

As an example, the rate region for Af = {1,2} is sketched 
in Fig. 2. The rates R\(t) and R^it) at which the two source 
sequences are acquired and compressed at the two nodes can 
be traded with one another without affecting the distortions 
of the reconstructions at the sink, as long as they remain in 
the shown rate region (8). 

We account for the cost of source acquisition and com- 
pression by defining a function P°(R n {t)) that provides the 
power spent for compressing the acquired data at a particular 
rate R n (t). For the sake of analytical tractability, we assume 
that each function P°(R n (t)) is 



P^(R n {t)) = a n R n {t), 



(10) 



for some coefficient a n > 0. Finally, we remark that the 
destination d is assumed not to have sensing capabilities, 
and thus is not able to acquire any measurements. We will 
treat the extension to this setting in Section VI. 

C. Energy Model 

Every node in the network is assumed to be powered 
via energy harvesting. The harvested energy is stored in an 
energy storage device, or battery, which is modeled as an 
energy queue, as in e.g., [7]. The energy queue size E n (t) 
at a node n G Af measures the amount of energy left in 
the battery of a node n at the beginning of time slot t. 
For convenience, we normalize the available energy to the 
number of channel uses (transmission symbols) per slot. 
Without loss of generality, we assume unitary slot duration 
so that the amount of power consumed for transmission and 
acquisition/compression is equivalent to the energy spent in 
a time slot. Therefore, at any time slot t, the overall energy 
used at a node n G Af must satisfy the availability constraint 



Pn + Pn(Rn(t))<E n (t). 



(ID 



That is, the total consumed energy due to transmission and 
acquisition/compression must not exceed the energy available 
at the node. 

We denote by H n (t) < i/ max the amount of energy 
harvestable by node n at time slot t, and we define the 
vector H(£) = [H\(t), . . . , H^(t)) as the energy-harvesting 
state. We assume that H(t) takes value in a finite set 
H = {hi,h2, . . . ,hp}, and is constant for the duration 
of a time slot, but i.i.d. over time slots. Finally, we define 
the probability ph ; = Pr[H(£) = hj], which accounts for 
possible spatial correlation of the harvestable energy across 
different nodes. 

The energy harvested at time t is assumed to be available 
for use at time £+1. Moreover, each node n G Af can decide 
how much of the harvestable energy H n (t) to store in the 
battery at time slot t, and we denote the harvesting decision 
by H n (t), with < Hjjf) < Hjt). We define the harvest- 
ing decision vector as H(£) = (H\(t), . . . , H^(t)). Variable 



H n (t) is introduced, following [7], to address the issue of 
assessing the needs of the system in terms of capacities of 
the energy storage devices. In fact, as in [7], we do not make 
any assumption about the battery maximum size. However, it 
will be proved later that performance arbitrarily close to the 
optimal attainable with no limitations on the battery capacity 
can be achieved with finite-capacity batteries. 

D. Queueing Dynamics 

We now detail the dynamics of the network queues. We 
define E(i) = {Ei(t), . . . ,E N (t)) to be the vector of the 
energy queue sizes of all nodes at time t. From the discussion 
above, for each node n G Af, E n (t) evolves as 

E n (t + 1) = E n (t) - P n (t) - P„ c (i?„(t)) + H n (t), (12) 

since at each time slot t, the energy P n (t) + P^(R n (t)) 
is consumed, while energy H n (t) is harvested. We assume 
E n (0) > for all n G Af. 

We also define the vector U(i) = (Ui(t), . . . , U N (t)), for 
each time slot t, to be the network data queue backlog, where 
U n (t) represents the amount of data queued at node n, which 
is normalized on the number of channel uses per time slot 
for convenience of notation, that is it is expressed in terms of 
bits over channel uses per slot. Denote as b the ratio between 
the number of channel uses per slot and the number of source 
samples per slot. Since b typically accounts for the ratio of the 
channel and source bandwidth, it is conventionally referred 
to as bandwidth ratio, [5]. We assume that each queue U n (t) 
evolves according to the following dynamics: 

U n (t + 1) < max {17 n (t) - /*„,*(*), 0} + /*«,„(<) + -^i, 

(13) 
since at any time slot t, each node n G Af can transmit, 
and thus remove from its data queue, at most /i n ,*(t) bits 
per channel use, while it can receive at most /U*, ra (£) bits 
per channel use due to transmissions from other nodes 
and R n (t)/b bits per channel use due to data acquisi- 
tion/compression. We assume that U n (0) = for all n G Af. 
Following standard definitions [28], we say that the network 
is stable if the following condition holds true: 



1 



T-l 



limsup -- V V E[U n {i)} < oo. 

4=0 n£./V 



(14) 



Notice that the network stability condition (14) implies that 
the data queue U n (t) of each node n E Af is stable in the 
sense that limsup ^ X^t=o ^[^™ W] < °°- 



E. Optimization Problem 

Define 9(£) = (S(i),0(i),H(t),U(i),E(i)) as the state 
of the network at time slot t. A (past-dependent) policy ir — 
{7r(t): t = 1, 2, . . .} is a collection of mappings between the 
past and current states {O(t): t — 1, . . . , t} and the current 
decision (R(i),D(i),H(i),P(i)) on rates R(£), distortion 



levels D(i), harvested energy H(£) and transmission powers 
P(t). Moreover, for each node n € Af, let i n (D n (t)) denote 
the cost incurred by node n when its corresponding distortion 
is D n (t). We assume that each function i n (D n (t)) is convex, 
finite and non-decreasing in the interval [D m [ n , Z? max ]. Our 
objective is to solve the following optimization problem: 



minimize F£ 



n£Af 



(15) 



where 



T-l 



El = limsup i V E[i n (D n (t))}, (16) 

subject to the rate-distortion constraints (8), the energy avail- 
ability constraint (11) and network stability constraint (14). 
Note that (16) is the per-slot average cost for node n. 

III. Lower bound 

In this section, we obtain a lower bound on the optimal 
network cost F * of problem (15). This result will be used 
in Section V to obtain analytical performance guarantees 
on our online optimization policy, presented in Section IV. 
The lower bound is expressed in terms of an optimization 
problem over parameters R( 0i ) = [R^ , . . . , i% ] and 
D (o.) = [£>(°*) ,...,£><£«)] f or all 0; e O, P^l with 
entries Pn,m for each (n, m) G L and for all Sj G S, and 



H (h fc ) = [H\ tlk) , . . . , H ( „ k) ] for all h k e U. The proof is 
based on relaxing the stability constraint (14) by imposing the 
necessary condition that the average arrival rate at each data 
queue be smaller than or equal to the average departure rate, 
and by also relaxing the energy availability constraint (1 1) by 
requiring it to be satisfied only on average. Finally, Lagrange 
relaxation is used on the resulting problem. The details of the 
proof are available in Appendix A. 

Theorem 3.1: The optimal network cost Fq satisfies the 
following inequality: 



for all A e 



VF*>d(\,v, X ) 



(17) 



v G R^,x G K . where d(X,v,x) 



is given by 

OiEO SjG5 \i k eU 

(18) 

with d 0i _ s ^ k (X^ 0i \v,x) defined in (19), where the infi- 
mum is taken under constraints: 

f) < #( 0< ) < 7? D ■ < n( 0i ) < D 

for all n G TV, Oi G O, 

< P^ ] < P max , for all n G Af, Sj G S, (21) 

and < H^ k) < h fe ,„, for all n G A/", h fc G H. (22) 

Proof: See Appendix A. ■ 
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IV. Proposed Policy 

In this section, we propose an algorithm designed fol- 
lowing the Lyapunov optimization framework, as developed 
in [26] [28], to solve the optimization problem (15). In par- 
ticular, we aim at finding an online policy it for problem (15) 
with close-to-optimal performance, by using Lyapunov opti- 
mization with weight perturbation. The technique of weight 
perturbation, as proposed in [7], is used to ensure that the 
energy queues are kept close to a target value. This is done 
to avoid battery underflow in a way that is reminiscent of 
the battery management strategies put forth in [14], and is 
further discussed below. 

The proposed policy operates by approximately minimiz- 
ing at each time slot the one-slot conditional Lyapunov 
drift plus penalty [28] of the energy and data queues ((12) 
and (13), respectively) of the network. The optimization 
is done in an on-line fashion based on the knowledge of 
the current channel state S(£), observation state O(t), data 
queue sizes U(t) and energy queue sizes E(£). Note that 
no knowledge of the statistics of the states is required, as 
it is standard with Lyapunov optimization techniques [26], 
[28]. Using this approach, we obtained the following online 
optimization algorithm. 

Algorithm: Fix a weight = [$\,. . . ,0n] € R+ and 
a parameter V > 0. At each time slot t, based on the 
values of the queues E(£) and U(t), channel states S(t) and 
observation states 0(£), perform the following: 

• Energy Harvesting: For each node n e A/*, choose 
H n (t) that minimizes (E n {t) — 6 n )H n (t) under the con- 
straint < H n (t) < H n (i). That is, if (E n (t) - 6„) < 
0, perform energy harvesting and store the harvested 
energy, i.e., set H n (t) = mm{8 n - E n (t),H n {t)}\ 
otherwise, perform no harvesting, i.e., set H n (t) = 0; 

• Rate-Distortion Optimization: Choose the source acqui- 
sition/compression rate vector R(£) = r = [n, . . . , r/v] 
and the distortion levels D(£) = d = \d\, . . . , d^\ to 
be an optimal solution of the following optimization 
problem: 



Power Allocation: Define the weight of a link (n, m) as 



minimize 

r,d 



E [Un{t)r n - (E n (t) - 6 n )pc(r n ) 
+Vi n (d n )}, 



(23) 



subject to the rate-distortion region constraint (8), and 
to the constraints < r n < i? max and D m [ n < d n < 
fl max , for all n e M; 



W n , m (t) = max{ [/„(£) - U m {t) - 5, 0}, 



(24) 



where 8 = / m a X Mma X +-Rma X , and choose P(£) = p with 
entries p n ^ m for (n, m) <G C to be an optimal solution 
of the following optimization problem: 



maximize 
p 



E [ E C„, m (p,S(i))W„, m (£) 

neN meAf\n (25) 

+(E n (t)-e n )p n ], 



where p n = J2 m eJV\nP",m, subject to constraints < 
Pn < -Fmax, for each n G N; 
• Queues Update: Update E(i) and U(i) according 
to (12) and (13), respectively. 

Remark 4.1: In the algorithm proposed above, the energy 
availability constraint (11) is not explicitly imposed. How- 
ever, as discussed in Section V, with a proper choice of 
the weight vector 6, the battery levels are guaranteed to be 
such that condition (11) is never violated. In other words, 
the effect of the weight vector is to ensure that, whenever 
the algorithm requires to draw energy from the batteries 
for transmission or acquisition/compression, there is energy 
available at the corresponding nodes to satisfy the request. 

A. Price-based Distributed Optimization 

While the Energy Harvesting step can be performed in- 
dependently by all nodes, the Rate-Distortion Optimization 
problem (23) and the Power Allocation problem (25) require 
centralized optimization. Decentralized implementations of 
the Power Allocation problem (25) are discussed in many 
papers, see, e.g., [29]. Here we discuss how to (approxi- 
mately) solve the Rate-Distortion Optimization problem (23) 
in a distributed fashion via dual decomposition [30] [31]. To 
this end, we introduce the Lagrange multipliers A <G M + 
for the 2^ — 1 coupling constraints (8), thus obtaining the 
Lagrangian function for problem (23): 



£(r,d,A) 



neAf 



[U n (t)r n - {E n {t) - 6 n )P°{r n ) 



vf n (d n )} + j2^ 



s(x m ,o(t)) 



log (27re)l*-l J] d i) ~ E r ' 
V iex m ) iex m 



, (26) 



where the second sum runs over all the 2 N — 1 subsets X m 
of Af. We will use the notation X m for the subsets of Af 
throughout the rest of the paper. Moreover, the dual function 
for problem (23) is 

G(A)=inf£(r,d,A), (27) 

r.d 

with constraints < r n < i? max and D min < d n < D max 
and the Lagrange dual problem is given by 

maximize G(A). (28) 

Following the dual decomposition approach [30] [31], the 
problem of calculating the dual function (27) for a given 
multiplier vector A can be decomposed into N local opti- 
mization subproblems, one for each node n e Af. Moreover, 
solution of the dual problem (28) can be performed in an 
iterative fashion using the subgradient method [30], as it is 
standard practice [30] [31]. This leads to the following price- 
based distributed iterative solution of the dual problem (28) 
for time slot t: 

Initialize A(l) >: 0. Then, for each iteration r = 1, 2, . . .: 
• For the given A(r) — A, each source node n solves the 
local optimization problem 

minimize U n (t)r n - (E n (t) - 9 n )P^{r n ) 

0<r rl <fi max , -D min <d„<D max 



+ Vi n (d n ) - (log(d n ) + r n ) J2 X r, 



(29) 



m: ntLXry 



obtaining the optimal values (r*(A),d* (A)); 

The dual variables A are updated using the subgradient 

method [30, Section 6.1] as 



A(r + l)=A(r)+e r a(A(r)), 



(30) 



where e T is a positive scalar step size and a (A) = 
E m g(^,0(i))-log(27re)l^l-E neJ vlogK(A)) + 
r* (A) is a subgradient of function G(A). 

With various choices for the weights e T (e.g., e T = 1/r), 
due to the concavity of function G(A), the procedure above 
is guaranteed to converge to the optimal value of the dual 
problem (28) [30, Section 3.4]. Moreover, under the given 
assumptions, problem (23) is convex and satisfies Slater's 
condition [32]. Therefore, strong duality holds, which guaran- 
tees that the optimal value of the dual problem (28) coincides 
with the optimal value of (23), and the optimal value of (28) 
is attained at some value A*. However, in order for the 
illustrated iterative procedure to converge to an optimal 
solution (r*,d*) of problem (23), we need that the value 
of the pair 3 (r, d) at which the infimum in (27) is attained 
for A — A* coincides with the optimal pair for the original 
problem (23). This can be guaranteed if the Lagrangian 
function £(r, d, A) is strictly convex in (r, d) [30, Section 
3.4]. As proposed in [33] this can be enforced by adding a 
small term e(||r|| 2 + ||d|| 2 ) to £(r, d, A) while performing the 

3 This pair exists in virtue of the Weierstrass theorem [30]. 



minimization (27), foe some e > 0. Although this operation 
is bound to make the solution only approximate, the quality 
of the approximation can be controlled by keeping e small. 

V. Performance Analysis 

In this section, we provide analytical insights into the 
performance of the policy proposed in Section IV. To this 
end, we define the parameters (3 n — min {a n , 1} (recall (10)) 



UdrQ-UAnax) 



log(d„/I5 max ) 



which is 



and 7 „ = sup r , mm < d7i < r , n 
finite under the given assumptions. 

Theorem 5.1: Under the proposed algorithm with 9 = 
[01, ... , 0jv], where 9 n = ^V + a„i? max + P max , we have: 

1) The data queue and the energy queue of all nodes are 
bounded as: 

0< E n (t) <6 n , (31) 

and0< U n {t) < 7 „U + i? max , (32) 

respectively, for all nodes n e Af and all times t; 

2) When a node n € Af allocates a non-zero power to 
any of its outgoing links (i.e., P n (t) > 0), and/or 
when it chooses a non-zero source acquisition rate 
(i.e., R n (t) > 0), thus expending energy for source 
acquisition/compression, we have that: 



E„(t) > a„R n 



P,r 



(33) 



This condition guarantees that the energy availability 
constraint (11) is satisfied for all nodes n € Af and all 
times t (see Remark 4.1 and Remark 5.2). 
3) The overall cost Fq (15) achieved by the proposed 
scheme satisfies the bound 



K ^J2 F n< F 
nGAf 



B 



(34) 



where 



o 



cost of prob- 
constant B is 

-'<-ma X / ^ I T" 



is the optimal 
lem (15) and the finite 

£> J* y/^max(Mmax ' ^masj i -"-max/ 

iv/2(/j max + a„rt max + -r max + 2a„it max .r max ) + 

A'X'WmaxAWx + #max/ 4 )- 

Proof: See Appendix C. ■ 

Remark 5.2: The fact that (33) implies that the proposed 
algorithm satisfies the energy availability constraint (11) at 
each time slot follows since each node n E Af cannot 
consume an energy larger that a„-R ma x + -Pmax in a time 
slot. In fact, Qf n i?max is the maximum energy spent for 
compressing the acquired data and P max is the maximum 
transmission energy consumption. 

Remark 5.3: Following [16], under the modified stability 
requirement limsup ^2~2t=i U n {t) < oo, for all n <G Af, 

T^oo 

the proposed algorithm can be proven to guarantee near- 
optimal performance with probability one. 



VI. Extension with side information at the sink 

We now consider an extended version of the problem stud- 
ied thus far, in which the sink node d, rather than being the 
final destination for the sources measured at the sensors, acts 
as a cluster head and communicates to a network collector 
node c (see Fig. 1), on a communication link modeled as 
for any other pair of node (see Section II-A). The key novel 
aspect of this extended model is that node d can measure a 
source correlated with that of the sensors and use such side 
information to improve the system performance. Specifically, 
thanks to the side information available at node d, the rate 
requirements for communication from the sensors to d can 
be reduced. However, node d, which is powered by energy- 
harvesting as all the sensors, also needs to communicate with 
node c. Therefore, a new trade-off arises between the energy 
allocated by d to acquire side information and that used by 
d to communicate with c. 

We now discuss how the model discussed in Section II 
needs to be modified in order to account for the different 
setting of interest here. First, the destination d acquires a 
source signal which is correlated with the sensor's mea- 
sures with a rate Rd(t). This affects the rate-distortion 
constraints (8) in that the entropy function g(X, 0(t)) should 
now be conditioned on the side information available at 
the receiver (see, e.g., [34]). This leads to modified rate- 
distortion constraints (8) with a function g(X,0(t), Rd(t)) 
that also depends on R d (t). An example of this function will 
be given in Section VII. The energy used for acquiring the 
side information is given by P^(R d (t)) = adRd(t) and the 
slot duration similar to all other nodes. Moreover, the data 
queue at node d evolves as 

U d (t + 1) < max{L/ d (t) - /u d , c (i),0} + /*,,,,(*), (35) 

where Hd,c(t) and /i*.d(t) represent, respectively, the trans- 
mitted and received data at time t, and transmission is to the 
collector node c. Note that no other node is connected to the 
network collector c apart from d. The energy queue Ed(t), 
instead, evolves according to (12). Finally, P(t) and S(t) 
are extended to consider the additional link (d, c) <G C and 
the rate achievable on that link is given by C d , c (P(t),S(t)), 
which is assumed to have the same properties as for all 
other links (see Section II). We refer to the power used for 
transmission by node d as P d . 

In what follow we modify the algorithm proposed in 
Section IV in order to address the new setting outlined above. 
The modified algorithm works as follows: 

• Energy Harvesting: Follow the same procedure as for 
the algorithm discussed in Section IV, for all nodes 
including node d\ 

• Rate-Distortion Allocation: Choose R n (t) and D n (t), 
n = 1, . . . , N, and Rd(t) to be the optimal solution of 



the following optimization problem: 

minimize V [U n (t)r n - (E n (t) - O n )P^(r n ) 
(r ' d) ' rd neM (36) 

+ Vi n (d n )] + (E d (t)-6 d )P c d {r d ) 1 



subject to ^2 



n£X 'n 



> 



g(*,0(t),r d ) 



log((27re)l*lrU*0'V* C Af, < r n < R max 
and D min < d„ < D max , n e Af and < r d < R max ; 
« Power Allocation: Define the weight of a link (n, m) <G 
C as 4 (24) and choose P(£) = p with entries p n , m for 
(n, m) <G C to be an optimal solution of the following 
optimization problem: 

maximize J^ I J^ C„ ;m (p,S(t))W„ ;m (i) 

nEAf raEAf\n 

+ (E n (t) - e n ) Pn ] + c d , c (p,s(t))Wd, c (t) 

+ (E d (t) - d )p d , 

subject to < p n < P max , for each n € Af U {d}. 

* Queues Update: Update E(t) and E d (t) according 
to (12), U(t) according to (13) and U d (t) using (35). 

The algorithm proposed above is a simple modification of 
the algorithm proposed in Section IV that accounts for the 
need to allocate rate and power also for node d. It can be 
proven that this algorithm has similar optimality properties as 
the algorithm of Section IV as summarized in Theorem 5.1. 
We omit a formal statement of this result here, since it is a 
straightforward extension of Theorem 5.1. 

VII. Numerical Results 

In this Section, we provide further insights into the per- 
formance of the system under study, via some numerical 
results. We consider the network topology of Fig. 1, where 
the set Af of nodes gathers spatially correlated data and 
transmits it to the sink node d. We first consider the set-up 
without side information at the sink described in Section II. 
We assume that nodes {1,2,3} collect the measurements, 
while nodes {4, 5} are only used as relays (or equivalently 
measure zero-power sources). The signal samples measured 
at nodes {1,2,3} are jointly Gaussian with zero mean and 
time-independent correlation matrix 



0(i) = 



1 


U) 


U) 


U! 


1 


w 


id 


U) 


1 



(38) 



where to e [ — 1, 1] is the spatial correlation coefficient. 
The channel state matrix S(t) has independent entries that 
are Rayleigh distributed, while the energy harvesting vector 
H(i) has independent entries that are uniformly distributed 
in [0,_ff max ]. Both channel and energy harvesting statistics 
are i.i.d. across time slots. 

For the channel capacity function, we consider 



C„, m (P(i),S(i)) 



log(l + P n ,m{t)S n , m {t)) for all 



4 We remind that C is extended to consider the link (d, c). 
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Figure 3. Fq vs maximum and average network queue size for 1 < V < Figure 4. F^ vs maximum and average network queue size for different 
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(n, m) E C, while the entropy function is given by (9) 
and the cost function is i n (D n (t)) = D n (t) for all 
n G AT. Moreover, we set the numerical values a n = 1, 
-ffmax = 3, £> min = 0.001 and F max = a n R m ^, with 
i?max = g({l,2,3},0(t)) - log((2^eA„i„) 3 ). In what 
follows, we refer to network queue size as the sum of the 
queue sizes of all nodes in AT. 

We first examine the effect of parameter V, which was 
shown in Theorem 5.1 to characterize the (V, 1/V) trade- 
off between the network queue size and the additive gap 
with respect to the lower bound of Theorem 3.1. To this 
end, in Fig. 3, we set u = 0.5 and plot the average sum- 
distortion Fq as a function of the maximum and average 
network queue size for different value of the parameter V. 
Confirming the results of Theorem 5.1, we observe that the 
sum-distortion F£ gradually converges to the lower bound set 
by the optimal value F * for increasing V. A closer inspection 
of the results also reveals an almost linear increase of the 
maximum and time average network queue size with respect 
to V, as suggested by Theorem 5.1 (not shown). 

Next, we evaluate the impact of the spatial correlation 
parameter oj. As discussed, an increasing ui is expected to 
lead to a reduction in the energy consumption for the same 
reconstruction accuracy at the sink thanks to the spatial 
energy trade-offs enabled by distributed source coding. This 
is confirmed by the results in Fig. 4, where we plot the 
sum-distortion Fq versus the average and maximum network 
queue size, where each point is obtained for a different value 
of the correlation ui in [0, 1). We note that an increasing oo 
leads to a reduction of both the network queue size and Fq, 
Note that the performance with u = corresponds to that 
of a conventional source coding system (i.e., not leverag- 
ing distributed source coding) as, in this case, distributed 
source coding does not offer any advantage and reduces 
to conventional compression. Thus, comparison between the 
performance with ui = and cj > reveals the gain of 
leveraging distributed source coding. Note that this gain is 



quite substantial, leading in the best case {uj — > 1) to a 
decrease of a factor 3 in terms of distortion and of a factor 
2.3 in terms of queue size at the nodes. 

Finally, we evaluate the performance in the scenario of 
Section VI, where the sink node d acts as a cluster head, 
that measures a source correlated with that of the remaining 
sensors and communicates the gathered data to node c 
(see Fig. 1). To this end, we replace the entropy function 
g(X,0{t)) with a function g(X,0(t) 1 R d {t)), that takes into 
account the side information obtained by d with rate Rd{t)- 
We recall that Rd(t) is a decision variable of the new prob- 
lem, see (36). Following [34], the function g(X, 0(t), R d (t)) 
is given by (9) where the correlation matrix (38) should 
now be conditioned on the side information available at 
the destination [34]. According to the simple source model 
described in Appendix F, we assume that this conditional 
covariance matrix is given by 



Q(t) = 



1 - LOUJ d (t) 

w(l -u>d(t)) 

w(l - 0J d (t)) 



w(l -LO d (t)) 
1 - LOU d (t) 

w(l - 0J d (t)) 



w(l -u d {t)) 

w(l - Ud(t)) 
1 - UU) d (t) 



(39) 

where uj d (t) = \ — 2~ Rdi < t \ We consider the same simulation 
parameters as above and we additionally set a d — 1 and, only 
for node d, i? max = 12. 

Fig. 5 shows the sum-distortion Fq and the average 
network queue size versus ui e [0,1). As a reference, we 
compare the performance of the proposed algorithm with that 
of a scheme that sets R d (t) = 0. This scheme, therefore, does 
not acquire side information at the sink and instead utilizes 
all the available energy at the sink for transmission to node 
c. It can be seen that gains in terms of memory and distortion 
can be obtained by properly allocating the available energy 
between the tasks of transmission and source coding at the 
sink node, e.g., a reduction of more than 25% and 21% is 
obtained for ui > 0.9 for the queue size and Ffi, respectively. 
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Figure 5. Fg and average network queue size vs source correlation ui. 
(V = 1000) 



VIII. Conclusions 

Energy harvesting poses new challenges in terms of energy 
management of wireless networks. In sensor networks, these 
challenges are compounded by the need for balancing the 
energy consumed by source coding tasks (i.e., data com- 
pression) against that used for transmission. Moreover, the 
correlation among the data readings collected by different 
sensors, if leveraged via distributed source coding, makes 
it possible to exploit spatial energy trade-offs across the 
sensors, thus allowing for better performance in terms of 
memory usage and distortion at the sink. Based on the 
above, this work has proposed a dynamic online optimization 
strategy for multihop wireless sensor networks with energy 
harvesting capabilities. This strategy jointly optimizes source 
coding and data transmission activities for time-varying 
sources and channels, by ensuring queue stability at the 
nodes and energy neutrality. The proposed technique, based 
on Lyapunov optimization, has been analytically shown to 
be characterized by a (V, 1/V) trade-off, revealing a linear 
relationship for queue and battery size (V) and an inverse 
proportionality in terms of optimality gap (1/V), where V 
is a tunable parameter of the algorithm. Numerical results 
have demonstrated the key role of source correlation and 
distributed source coding in the system performance. 
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Appendix A 
Proof of Theorem 3.1 

Proof: Define as <j>* the optimal value of the following 
problem: 



A' 



minimize 



^EE^E< M^) < 4 °) 

ne/JoieO k=l 

subject to: 

g(^o0-log(27re)l*in^i?[fc] 
< ^R { $ k] M&\XCN,Oi€0,k€ [!,...,#], (41) 



nex 



R M 



E *. E^^f 1 + E * E ^W(pL"U) 

H eo fc=i Sies fc=i 

K 

< E A E ei S<) /V*(Pi S<) ,s^fbr all n G AT, (42) 



Si es fe=i 

K 



A 



E * E £ i] (Pit) + E p 0l E ^ p n «ti 



Si es fe=i 



OiGO fc=l 



A' 



E^E^ h,) <w> fora11 ^-^ 



(43) 



hieH fe=i 



o<^U Si U hi) <i, 



for all Oj G O, Si G S, h t G U, k G [1, . . . , K], 



A 



A 



A 



E^-^E^-^E^ 1 ^ 1 ' 
fe=i fc=i fc=i 

for all Oi G O, sj G S, hi g W, 

U S: rt n [ji <: A max , i/min S "„[U S: "mHi 



for all n €J\f,Oi G (D,k G [1,...,K], 
< P^ < Pmax, for all n G TV, s, G 5, k G [1, . . . , #], 
< H^( k ) < h 4 ,„,for all n G 7V,hi G ft, fc G [1, . . . ,K], 



where the minimization is taken over variables $° , Q k , 



^\ P (0i) 



Z9 (o ;> 



if 



(hi) 



, . f ./ ,,•■" ,1 an d P ru for all n € J\f, 

[ky ri,[ky n,[k\ n ,[k\ 

o l G 0, s,j G 5, hi G ft and fc G [l,...,ftl with 

X = 2^ + 2. Variables {P^ } and {-D^ | can 
be interpreted, respectively, as the set of rates and distortions 
selected by node n G N when the source state is 0(t) = Oi. 
Specifically, node n selects rate R,L and distortion D:°* 



,[fe] m,u ulaL,J1Llu " ^ n ,[fe] 

with probability i?r*' when the source state is O(i) = Oi. 
Variables P ,,, > can be seen as the transmission 

I n,m,[fc]J fc=1 

powers allocated to link (n, m) G £, when the channel state 
S(i) = Si. Each power P„ m r fel is selected with probability 

g k s if S(£) = Si. Finally, variables < H n ?X > represent 
the harvested energy when the energy harvesting state is 
H(t) = hi = [hi,i,. ■ . , hi t N]. Each energy H^ k \ is selected 



„(h< 



with probability ip k if H(t) = hi. Note that we added the 
constant V in the optimization function for our later analysis. 
Theorem A.l: The optimal network cost PJ satisfies the 
following inequality: 



VP * > 



(44) 



where <ft* is the optimal value of the optimization prob- 
lem (40). The proof of Theorem A.l is in Appendix B. 

A generally looser lower bound can be evaluated by the 
weak duality in Lagrange optimization theory [32], which 
is easily seen to lead to Theorem 3.1. In fact, in (18), the 
parameters Am for m — [1, . . . , 2 N — 1] and Oi G O 
are the L(2 N — 1) Lagrange multipliers corresponding to 
constraints (41), parameters v n for n — [1,...,N] are 
the Lagrange multipliers corresponding to constraints (42) 
and parameters \n for n — [1,...,N], are the Lagrange 
multipliers corresponding to constraints (43). ■ 



Appendix B 
Proof of Theorem A. 1 

Proof: We follow an argument similar to the one used 
in [15]. Consider any stable policy n, i.e., a policy such that 
the condition (14) is satisfied under it. Since E[/i*„(t) + 
R n (t)/b - fi rh ,(t)} < (N - l) Mmax + P max /6, from [28, 
Theorem 2.8], constraint (14) implies the mean rate stability 
constraint and thus the condition 



T-l 



limsup — \] E 



t=o 



l^*,n[f) 



Rn(t) 



< 



T-l 



(45) 



lirninf I^E[ M „,4i)] 



t=o 



for each node n G J\f. We thus relax problem (40) by 
substituting (14) with (45). We further relax the energy 
availability constraint (11), imposing average stability for the 



energy queues (see (12)) 



T-l 



limsup 1 J2 nPn(t)+P c n {R n {t))] 



T-l 



(46) 



limsup iVE[ff n (t)] 



t=0 



For the relaxed problem, we can show as in [15] that the 
optimal policy is stationary and depends only on the source 
and channel state. From this, by Caratheodory's theorem [35], 
we obtain that the problem at hand is equivalent to (40). ■ 

Appendix C 
Proof of Theorem 5 . 1 

Proof: 
1) From the energy harvesting part of the algorithm, we 
have that E n (t) < 9 n , since harvesting is performed only 
when E n (t) < 9 n and the maximum amount of harvested 
energy in that case is 8 n — E n (t). This proves (31). We 
now prove (32) by induction on t. Inequality (32) holds for 
t = 0, since U n (0) = for all n. Then, assuming that (32) 
is satisfied for all n at time t, we show that it holds also for 
time t + 1. To this end, we consider separately the different 
possible cases in which a node n receives or not data from 
other nodes (i.e., endogenous data) and/or acquires or not its 
measurement (i.e., exogenous data). First, if node n receives 
neither endogenous nor exogenous data, then we have that 
U n (t + 1) < U n (t) < 7„y + -Rmax, which proves the claim. 
Second, assume that node n € Af receives endogenous, but 
not exogenous, data. It follows from (25) that, for some node 
m € TV, with m / n, we must have 



U n (t) <U m (t)-5< 7 „V + i? max - 5. 



(47) 



However, since any node can receive at most /maxMmax bits 
per channel use of endogenous data, we have from (47) and 
the definition of S that U n (t + 1) < j n V < "f n V + R max , 
which proves the claim. 

We now analyze the case where node n receives exoge- 
nous, but not endogenous, data. This implies that r n > is 
obtained from the solution of problem (23). We define the 
corresponding Lagrangian function as 



£(r,d,A,u) 



nGTV 



[U n {t)r n - (E n (t) - d„)PZ{r n ) + Vi n {dn)} 



.^A ro [g(* m ,0(i))-log((27re)l*-l J[ d, 



E 

iex m 






(48) 



where we have relaxed the constraints (8) and constraints 
dn < Bmax- The Lagrange dual function is given by 



G(X,v) =miZ(r,d,X,v) 

r.d 



(49) 



where the infimum is taken with the constraints < r n < 
JJ max and d n > 0, and the dual problem is given by: 



maximize G(A, v). 



(50) 



Lemma CI: Any dual optimal vector A* (i.e., a vector A 
maximizing (50)) satisfies the conditions 



E y m <lnV, 



(51) 



m: n^Xrr 



for all n € Af. Moreover, any primal optimal r* satisfies the 
condition 



r* = argmin U n {t)r n - (E n (t) - 9 n )P^(r n ) 

0<r„<_R max 



E K 



(52) 



m: n^Xjr 



The proof of Lemma C.l can be found in Appendix D. 

According to (52) we have that r* > is an optimal 
solution of problem (23) only if the value of the right-hand 
side of (52) evaluated at r n = is larger than the value 
obtained by evaluating it at r*, which can be expressed, 
using (10), as 

U n (t)r* n + (6 n - E n (t))a n r* n - r* n ]T A^ < 0. (53) 

m: nSiXm 

From (31), (51) and (53), we further obtain: 

U n (t)< Yl K,<lnV, 



(54) 



which implies that a node n receives exogenous data from 
outside the network only when U n (t) < "fnV. Hence, 
recalling that R n (t) < i? max , we obtain the desired result 
U n {t + 1) < j n V + R max . 

Finally, if a node n receives both endogenous and exoge- 
nous data, we have from (47) that U n (t) < jnV ~ l max iJ, maK . 
Since a node n can receive at most Z max Mmax bits per channel 
use of endogenous data and i? max bits per channel use of 
exogenous data, we have the desired inequality U n (t+ 1) < 
7„y + i? max , which completes the proof of part 1). 
2) To prove the claim, we need to show that if 



E n (t) < a n R max + P n 



(55) 



then the following two conditions must be satisfied: 

a) the Rate-Distortion problem (23) is minimized by choos- 
ing R n (t) = r n = (which implies P^(t) = 0) for all 
n e Af; 

b) the Power Allocation problem (25) selects a power matrix 
P(t) such that P n (t) = for all n e Af. 

From Lemma C.l, and in particular from (52), condition a) 
is verified if 



U n (t)r n - (E n - e n )P^{r n ) 
— r 



J2 x * m > 0, for all r n > 0, ( 56 ) 



where we recall that A* is any optimal dual vector of 
problem (50). This is proved by the following inequalities: 



U n {t)r n - (E„ 



J n i^n ' n 



m: tiG^t 



a: 



> U n (t)r n + -^-Va n r n - r n ^ Kn 

' n m: neX m 

> U n {t)r n + -^-Va n r n - r n j n V 

Pn 

= U n (t)r n + ~f n V > 

Pn 

where the first inequality follows from (55) and the assump- 
tion of Theorem 5.1 that 9 n = ttV + a n R 



/3„ 



*n J l max 



P, 



the second from (51); and the last inequality follows from 
U n (t) > 0, r n > and from the definition of /3„. This 
proves (56) and thus that condition a) is satisfied if (55) holds. 
To prove b) we first note that the bound (32) implies that 
the weight (24) satisfies the inequality 

W n , m (t) = max{l/ n (t) - U m (t) - 6,0} 

for all (n, m) E C and for all time t. We now show by 
contradiction that condition b) holds when (55) is satisfied. 
To this end, assume that the power allocation vector P* that 
maximizes (25) at time t is such that some entry P* m is 
positive. Starting from P*, we now obtain a new power 
allocation vector P, in which we set P„, m = 0. Clearly, 
the power matrix P is also feasible. We demonstrate that the 
objective function of (25) when evaluated at P* is smaller 
than at P, thus leading to a contradiction. Denoting as G(P) 
the objective function of (25), this is shown by the following 
inequalities: 

G(P*)-G(P) = 
= E E [C n ,i(P%S(t))-C„,,(P,S(t))]W n ,,(t) 

nGAf ZeA/"\n 

+ (E n (t) - 6 n )P* im 
< C„, ro (P*,S(t))W„, ro (i) + (£?„(*)- e n )p* >m 

< C„, m (P* ,S(t))(lnV - Z max /w) + (£„(*) - n )K,m 

< (inV - (maxjtJmax)^,™ + (E n (t) - 6 n )P^ m 

T 

< \1n,V ~ *maxA t max)4-'ri,m — ~7T~ n.m ^ ", 

Pn 

where the first inequality derives from p, n j(P*,S(t)) — 
/U„ ; ;(P,S(i)) < for all I ^ m {Property 2), the second 
from (57), the third from Property 1 and the fourth from (55). 
This shows that P* is not optimal for (25), thus leading to 
a contradiction, which completes the proof of 2). 
3) The proof of 3) is a relatively simple application of 
the general theory of [26] [28]. The details are provided 
in the following for completeness. We first define the stan- 
dard one-slot conditional Lyapunov Drift-plus penalty of the 
queues E(i) and U(£). To this end, we define Z n (t) — 
(U n (i),E n (i) — n ) and the corresponding vector Z(£) = 



(U(t),E(£) - 6). Following the standard definition [28], the 
quadratic perturbed Lyapunov function is given by 

1 N 
L(Z(t)) = _^||Z ri (i)|| 2 

71=1 

N N 

= o£(E>n(i)) 2 + 5£(£ n (t)-0n) 2 



n— 1 n— 1 

= L(U(i)) + L(E(i)-0), 



(58) 



while the one-slot conditional Lyapunov drift A(Z(i)) is 

A(Z(t)) = E [L(Z(t + 1)) - L(Z(t))\Z(t)} . (59) 

The proof of the following lemma can be found in Ap- 
pendix E. 

Lemma C.2: Under any feasible policy for problem (15) 
we have the inequality 

A(Z(t)) < B + J2 U n (t)E[-^4t) + n*, n {t) 

neAf 

+ R n (t)\Z(t)} + J2 ( E n(t) - 0n)E[ - Pn(Rn(t)) 
neAf 



-P n (t)+H(t)\Z(t) , 



= N(, 



R„ 



(60) 



-Rmax/ 2 ) + 



<)• 



With B — ii ^pmaxVPmax "r -"-max,/ 

iV / 2{H max + a n^ai&yi ' "max • ™n-Kmax"max; 

The proposed policy is based on the minimization 
of the drift-plus-penalty function [26] [28] A(Z(i)) + 
VE ^2 ne j^f i n (D n (t)) Z(t) . This amounts to finding a 
policy that minimizes the right-hand side of (61) (where 
the inequality follows from (60)). Minimization of (61) is 
done with respect to (R(i),D(i),H(i),P(i)) for the given 
(S(£), O(i), H(i), U(i), E(i)) under the constraints (8) and 



< R n < R n 



D n 



< D n < D„ 



as per definition 



of policy in Section II-E. It is now not difficult to see that, 
similar to [7], by Lagrangian relaxation of the constraints (8), 
the dual function of the said minimization problem, when 
considering fixed O(i) = o^, S(i) = Sj, H(t) = h^ and 
fixed queue lengths (U(i),E(t)), is given by d(A( 0i )) = 
do„s 3 ,h fc (A( *\U(t),E(t) - 6) as defined in (19). Note that 
the Lagrange multipliers A^ ^ are associated to the con- 
straints (8). Moreover, by convexity and Slater's conditions, 
we have that strong duality holds, and thus the minimum 
of (61) equals d(A( ^) for a given value A<°*) = \( 0i >. 

From the discussion above, the minimum of the right-hand 
side of the bound (61) equals 

B + E[d Oi , s ,, hfc (A^*,U(t),E(t)-0)|Z(i)] = 

= B + d(A*,U(i),E(£)-0) (62) 



for some A* € 



p L(2~-l) 



(A* collects all X^*). But by 



Theorem 3.1, we have that 



FF o *>d(A*,U(i),E(i)-0). 



(63) 



A(Z(i)) + VE 



E in(D n (t)) Z(i) 



%eAf 



<B+J2 ( E n(t) - 0n)l \H n (t)\Z(t) 



%eN 



+E 



-E 



E (U n (t)R n (t) - (E n (t) - e n )P c n {R n (t)) + Vi n {D n (t))) Z(t) 



.tieat 



X! ( 2 C n , ro (P(t),S(t))(J7 n (t)-l7 m (t)) + (f; n (t)-« n )P n (t) 

. n€vV \m: (n,m)e£ y 



Z(i) 



(61) 



From (61), we can now write that for the considered policy 
that minimize (61), we have the inequality 



A(Z(t))+VI 



ra£Af 



in(Dn(t)) 



Z(i) 



<B + VF£. (64) 



Moreover, taking expectation over Z(i) and summing the 
above over t = 0, .... T — 1, we have: 



T-l 



E[L(Z(T)) - L(Z(0))] + V E E 



t=o 



E ^( D "(*)) 

nEAf 



<TB + TVFZ 



(65) 



Rearranging the terms, using the fact that L(Z(t)) > and 
L(Z(0)) = 0, dividing both sides by FT, and taking the 
limsup as T — *• oo, we get: 



limsup ^^E[f„(D„(i))]<F* 



T->oo 



,T ^ 

«=Af t=0 






(66) 



This shows that our policy satisfies the desired claim. 

It remains to be discussed whether the proposed policy 
does indeed minimize (61). It can be seen, similar to [7] that 
the proposed policy minimizes a modified version of (61) 
where (U n (t) — U m (t)) is replaced by max{ [/„(£) — U m (t) — 
6,0} (cf. (24)). Moreover, when (6 n - E n (t)) < H n (t), we 
harvest a reduced amount of energy. This implies that the 
right-hand side of (61) under the proposed policy is generally 
larger than with the policy discussed above that minimizes 
the right-hand side of (61). However, the loss is at most 

< E E Hn,m(t)5 < N8l max fJ. max (67) 

n£j\T m: (n,m)GC 

for the power allocation part of the algorithm, and 

- E n (t))(H n (t) - (6 



o<E( 

neM 



NH 2 

?n " £n(*))) < — f 21 

(68) 

for the energy harvesting. This shows that (66) also holds for 
the proposed policy as long as we substitute B with B. This 
concludes the proof. ■ 

Appendix D 
Proof of Lemma C. 1 

Proof: Let A* and v* be an optimal solution of the dual 
problem (50), and r* = [r*, . . . , r* N ] and d* = [d*, . . . , d* N ] 



be an optimal solution of the (primal) problem (23). Exis- 
tence of (r*,d*) and (A*,v*) is guaranteed by Weierstrass 
theorem [36, Proposition 2.1.1] and by Slater's condition [36, 
Proposition 3.5.4, part a)]. By [36, Proposition 6.1.1], the 
following conditions must be satisfied by d* and (A*,v*): 
primal feasibility, namely d* < D max , and the comple- 
mentary slackness conditions w*(<i* — D mffi ) = for all 
n e Af, and (r*,d*) = argmin £(r,d, A*, v*) where the 
minimization is taken under the constraints d n > D m - ln and 
< r* < i? max for all n e N. From (48), the given 
conditions imply that 



Vi n {D max ) - log(Anax) J2 K, 



m: n€.X n 



jw„K)-io g «) E x *m)>o, 



(69) 



must be satisfied. This is because the Lagrangian 
£(r, d, X*,v*) when evaluated at d n = rf* should be no 
larger than for d n = D mllx . We thus have the inequalities 



E K, 



< 



v 



< 



log«/£> max ) 

fn(d n ) - in{D tt 



sup 

1 <d„<D„ 



log(d„/£> max ) 



V = ln V, 



where the second inequalities follows since -D m j n < d^ < 
I) max and the third from the definition of j n . ■ 

Appendix E 
Proof of Lemma C.2 

Proof: First, let us consider the time evolution of the 
data queue U n (t) of a generic node n. By squaring both sides 
of (13) and using the fact that for any i£l, (max(x, 0)) 2 < 
x 2 , we have: 

(U n (t + l)) 2 - (U n (t)f = (max(C7„(i) - /*„,*(*). 0) 
+H*, n (t) + R n (t)f - (U n (t)f 

< (Mn,*(*)) 2 + (M*,n(*) + Rn(t)f - 2,U„,* (t)(^,„(t) 

+R n (t)) + 2U n (t)(-n n ,.(t) + A*.,„(t) + R n {t)) 

<(fr,,(t)) 2 + (^(t)+fin(t)) 2 

+2U n (t)(-HnAt) + M.,n(t) + Rn{t)). (70) 



By defining B v = /U max (/Wx + R max ) + # max /2, we then 
see that: 

l -[(U n {t + l)f + {U n {t)f] 

<B V + U n (t)[-ll n ,*{t) + /*„,„(*) + R n {t)}. (71) 

Similarly, let us consider the perturbed evolution of the 
energy queue E n (t). By squaring both sides of (12) we have: 

(E n (t + 1) - e n f - (E n (t) - e n f =_ 
= (E n {t) - P°{R n {t)) - P n {t) + H n {t) - e n f 

- (E n (t) - 6 n f 

= (-P, c l (R n (t))-P n (t)+H n (t)) 2 

+2(E n (t) - n )(-PZ(R n (t)) - P n {t) + H n {t)). (72) 

By defining B E = ±(#Lx + "n^iax + C + 
2a n i?ma X fma X ), we then see that: 

i[(£„(t + l)-#„) 2 -(£ n (t)-0 n ) 2 ] 

< B E + (E n (t) - 6 n )(-pc(R n (t)) - P n (t) + H n (t)). (73) 

Now by summing (71) and (73) over all n e TV, 
and by defining B = N{B V + B E ) = 

N (/Jmax(/*max + -Rmax) + -Rmax/2) + N/2(H max + 
"n-^max + ^max + 2a„i? ma x-Pmax), We have: 

L(Z(t + 1)) - L(Z(t)) <B 

N 
+ J2 U n (t)(-Ll n ,*(t) + H*, n (t) + R n {t)) 
n=l 
N 

+ J2(En(t) - 9 n )(-PC(R n (t)) - P n (t) + H n {t)]. (74) 

Taking the expectation on both sides over the random obser- 
vation, channel and energy harvesting and conditioning on 
Z(i), the lemma follows. ■ 

Appendix F 
Source model 

Here we present a simple source model for which we 
determine numerical results in Section VII. Let the source 
signals measured at sensors in M be spatially correlated with 
parameter ui. Since the measurements are Gaussian we can 
write for the zth sensor Xi = y/uJA + \/l — uiBi, with A and 
Bi independent Gaussian random variables with zero mean 
and unitary variance. Moreover, we assume that the sink is 
able to measure A with an accuracy that depends on the rate 
Rd used for acquisition. From standard rate-distortion theory, 
we have the relationship R^ = I(A; Y), where Y is the side 
information available at the sink. By choosing the optimal 
test channel Y = y/uJdA + \/l — uidC, where ujd denotes the 
correlation between the measurement Y at the sink and A 
(see, e.g., [5]), we obtain the equations reported in the text. 



